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(54) Resource scheduling 

(57) A method of administering resource utilisation 
in a computer comprises running a first process (1 3 : 14, 
15) to make a reservation for access to a resource in 
dependence on a resource requirement communication 
from an application (21 ) and running a second process 
(16, 17, 19) to grant requests for access to said resource 
from said application in dependence on said reserva- 



tion. A further process (12) provides a common interface 
between the first process (13, 14, 15) for each resource 
and the application (21). The further process (12) con- 
verts high-level abstract resource requirement defini- 
tions into formats applicable to the first process (1 3, 14, 
1 5) for the resource in question. The processes are pref- 
erably implemented at methods of software objects. 



< 



CO 



Q. 

LU 




Figure 2 



PrWed by Jouve. 75001 PARIS (FR) 



3NSDOCID: <EP. 



.1037147A1J _> 



1 EP 1 037 

Description 

[0001] The present invention relates to resource 
scheduling in a computer. 

[0002] Enterprise level general-purpose operating 5 
systems, both workstation and server, are traditionally 
designed for efficient resource utilisation. Applications 
share a common set of resources, which are initially 
scheduled according to their priority and then in a fair 
weighted manner. This approach results is both an effi- 10 
cient and fair division of resources to user-level applica- 
tions. However, certain applications, in particularly mul- 
timedia processing applications, are inherently sensitive 
to resource availability. As a result, the use of priority- 
based resource federation policies often results in such *5 
applications failing to meet their processing goals. 
[0003] Conventionally, relatively complex processing 
is required of the scheduling process at the stage where 
a process is granted access to a resource. The present 
invenlion enables a reduction in the complexity of the 20 
processing at the access grant stage by providing for 
reservation of resources in advance. 
[0004] According to the present invention, there is 
provided a method of administering resource utilisation 
in a computer, the method comprising running a first 25 
procoss to make a reservation for access to a resource 
in dependence on a resource requirement communica- 
tion Irom an application process and running a second 
process to grant requests for access to said resource 
from said application process in dependence on said 30 
reservationr The reservation request may be made by 
an application per se or a sub-process or component 
thereof Furthermore, access may be granted to any 
process of an application or for a particular sub-unit of 
the application for which the reservation request was 35 
made 

[0005] Making reservations and granting access at 
application level assists in embodiments of the present 
invention intended to operate with legacy code. Howev- 
er, where all applications are written for an operating 40 
system employing a scheduling method according to the 
present invention, component or thread level reserva- 
tion and access grant is preferred. 
[0006] Preferably, a method according to the present 
invention comprises: 45 

instantiating a scheduling object having a method 
or methods for processing reservation requests for 
a plurality of resources and initiating resource spe- 
cific reservation processing; so 
instantiating a reservation object having a method 
or methods for making reservations for access to a 
resource; 

invoking a method of the scheduling object from an 
application process, said method taking a first re- ss 
source access requirement definition as a parame- 
ter; 

invoking, from said method of the scheduling object 



147 A1 2 

with a second resource access requirement defini- 
tion as a parameter, a reservation method (said first 
process) of the reservation object to make a reser- 
vation for the application process; 
running a resource specific scheduling process 
(said second process) to grant access to a resource 
in dependence on the reservation made by the res- 
ervation object; and 

utilising said resource for the purposes of said ap- 
plication process. 

[0007] Preferably the method of the scheduling ob- 
ject translates the first resource requirement definition 
into the second resource requirement definition. The ad- 
vantage of this is that the resource request can be de- 
fined by a programmer in a hardware independent man- 
ner and the scheduling object for a particular platform 
can translate that programmer-produced resource re- 
quest in a potentially non-linear manner on the basis of 
the properties of the particular platform. 
[0008] A method according to the present invention 
can be applied to the allocation of cpu time, access to 
mass storage devices, e.g. hard disk drives, optical disk 
drives and the like, and memory management. 
[0009] Preferably in the case of mass storage access, 
said first process comprises allocating one or more "lot- 
tery ticket" numbers to said application or a process 
thereof in dependence on the second resource access 
requirement definition and storing requests for access 
to a mass storage device from application processes; 

generating substantially randomly a lottery ticket 
number; and 

if no application process has been allocated said 
lottery ticket number then passing on to a mass stor- 
age device driver process the stored request for ac- 
cess from an application process selected on the 
basis of a predetermined prioritisation criterion else 
passing on to a mass storage device driver process 
a stored request for access from an application 
process to which said lottery ticket number was al- 
located. 

[0010] The majority of multi-tasking operating sys- 
tems, including Windows NT, Unix and Linux, use a 
scheduling algorithm known as Round-Robin. At the 
end of a time slot, or when a thread has revoked its time 
slice, the dispatcher takes off the first thread which is 
waiting on the highest priority queue (some queues will 
be empty). Additional techniques are used to prevent 
starvation by dynamically increasing the priority of long- 
waiting threads. 

[0011] According to the present invention, there is al- 
so provided a method of scheduling access to a cpu 
which may be implemented in a method of administering 
resource utilisation in a computer according to the 
present invention, the method comprising the steps of: 
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generating a one-dimensional reservation request 
pattern; 

merging the reservation request pattern with a one- 
dimensional cpu access control pattern, represent- 
ing empty cpu access time slots and reserved cpu 
access time slots, without substantially disturbing 
either the reservation request pattern or the re- 
served cpu access time slots in the reservation re- 
quest pattern. 

[0012] The reservation request pattern may be short- 
er than the cpu access control pattern, in which case the 
reservation request pattern is effectively extended by 
repetition in the merging process. 
[0013] Preferably, said merging step comprises relo- 
cating a non-empty time slot element of the reservation 
request pattern or the cpu access control pattern such 
that the patterns can be merged without any reserved 
cpu access time slot elements being deleted or overwrit- 
ten. More preferably, the relocated non-empty time slot 
element is relocated by an amount defined in said time 
slot element. Both forwards and backwards shifts or shilt 
limits may be included in each element. 
[0014] According to the present invention, there is 
provided a method of granting access to a cpu which 
may be implemented in a method of administering re- 
source utilisation in a computer according to the present 
invention, the method comprising: 

generating a one-dimensional cpu access control 
pattern, each element of which relates to a quantum 
of cpu access time; and 
at the end of a quantum of cpu access time: 

granting access to any pending processes hav- 
ing a priority greater than a predetermined lev- 
el; and then 

if the next pattern element is empty then grant- 
ing access to a pending process meeting a pre- 
determined prioritisation (e.g. Round-Robin) 
criterion else granting access to a process iden- 
tified in the pattern element. 

[0015] Preferably, an entry for the process in any of a 
plurality of different priority queues will be searched for 
and access will be granted in respect of the entry for the 
process having the highest priority. Alternatively, access 
is granted to the process identified in the pattern ele- 
ment when there is not a populated process queue hav- 
ing a higher priority than the queue in which said process 
is present. 

[0016] Preferably, the a one-dimensional cpu access 
control pattern is generated by a method of scheduling 
access to a cpu according to the present invention. 
[001 7] Embodiments of the present invention will now 
be described, by way of example, with reference to the 
accompanying drawings, in which:- 



Figure 1 is a block diagram of a general purpose 
computer; 

Figure 2 is a block diagram of software components 
embodied in a computer employing resource 

s scheduling according to the present invention; 

Figure 3 illustrates part of a cpu time reservation 
method according to the present invention; 
Figure 4 is a flow chart illustrating the operation of 
the cpu secondary scheduler of Figure 2; and 

to Figure 5 is a flow chart illustrating an alternative 
manner of operation of the cpu scheduler of Figure 
2. 

[0018] Referring to Figure 1 , a general purpose corn- 
's puter comprises a CPU 1, RAM 2, ROM 3, a video in- 
terface card 4, a hard disk drive controller 5, a keyboard 
interface 6 and a mouse interface 7 all interconnected 
by a combined data and address bus 8. A video monitor 
9 is connected to the output of the video interface card 
20 4. a hard disk drive 10 is connected to the hard disk 
drive controller 5. A keyboard 11 is connected to the in- 
put of the keyboard interface 6 and a mouse 12 is con- 
nected to the input of the mouse interface 7. 
[0019] The general purpose computer is operating 
25 under the control of a multi-tasking operating system. 
The operating system may be a known operating sys- 
tem such as Windows NT, Unix or Linux but with the re- 
source scheduling modified to operate as described be- 
low 

30 [0020] Referring to Figure 2, in order to provide re- 
source sharing the operating system comprises a dis- 
patcher component 1 1 , a primary scheduler component 
12, a cpu reservation component 13, a hard disk drive 
reservation component 14, a memory reservation com- 

35 ponent 15, a cpu secondary scheduler 16, a hard disk 
drive secondary scheduler 17 and a memory balance 
manager 19. 

[0021] The dispatcher component 11 maintains 
queues of threads awaiting servicing by the CPU 1 . A 

40 thread may have a priority value between 0 and 31 , 31 
being the highest priority, and the dispatcher component 
11 maintains a separate queue for each priority. Thus, 
all the threads with priority 0 are in one queue, all the 
threads with priority 1 are in another queue and so on. 

45 within each queue, new threads are added to the tail of 
the queue which operates on a FIFO principle. 
[0022] The cpu secondary scheduler 16 is responsi- 
ble for granting access to the CPU 1 to threads in a man- 
ner that will be described in more detail below. 

50 [0023] The hard disk drive secondary scheduler 17 
runs a lottery to determine access to the hard disk drive 
10. 

[0024] The primary scheduler 1 2 translates high-level 
abstract resource reservation requests, made by or oh 
55 behalf of application components 22, into a form suitable 
for use by the reservation components 13, 14, 15. The 
reservation components 13, 14, 15 attempt to reserve 
the requested resources. 
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[0025] A guaranteed service application 21 for use 
with the present embodiment, consists of a plurality of 
a components 22 that are instantiated as necessary. 
Each component 22 comprises at least one thread 23. 
The constructor 24 of each component 22 includes a 5 
procedure which calls cpu, hard disk drive and memory 
reservation methods of the primary scheduler 12. The 
component 22 indicates its level of need for a resource 
by passing the application's process ID (which is unique 
to the application 21 ), an indication of the amount of re- 10 
source required as parameters when calling the reser- 
vation methods and optionally a duration for the reser- 
vation. Alternatively, the application 21 itself or a man- 
agement entity, can make a reservation on behalf of the 
guaranteed service application 21 . is 
[0026] A best effort service application 25 consists of 
a plurality of a components 26 that are instantiated as 
necessary. The best effort service application's compo- 
nents 26 do not make resource reservations or have res- 
ervations made for them. 20 
[0027] Considering now the case of a request for cpu 
time, when the cpu reservation method of the primary 
scheduler 12 is invoked by an application component's 
constructor 24, the cpu reservation method maps a per- 
centage of resource time, received as a reservation pa- 25 
rameter, into a component "signature". The component 
signature comprises a one-dimensional array of up to 
256 elements, each element of which is a record com- 
prising fields for the application's process ID for the com- 
ponent 22, a reservation ID, the forwards flexibility and 30 
the backwards flexibility. Initially, the fields of the ele- 
ments of the array are empty. The fields of selected el- 
ements of the array are then filled using the parameters 
of the call made by the component 22. For instance, if 
the percentage value in the call is 10%, every tenth el- 35 
ement of the array is filled and if the percentage value 
us 20%, every fifth element of the array is filled in. This 
mapping generally aims to achieve the finest grained 
reservation signature possible. 

[0028] Once the component signature has been com- 40 
pleted, the cpu reservation component 1 3 is instantiated 
on demand by the primary scheduler 12. When the cpu 
reservation component 13 is instantiated, the primary 
scheduler 12 forwards the component signature, appli- 
cation and reservation Ids and any duration to the cpu 4s 
reservation component 1 3 via a method invocation. 
[0029] The cpu reservation component 13 then tries 
to merge the component signature into a copy 31 of a 
master signature 32. The component signature is repli- 
cated to fill the full lengths of the signature arrays 31 , 32 so 
used by the cpu reservation component 13 and the cpu 
secondary scheduler 1 6. For example a reservation sig- 
nature of 25% maps onto a 1 in 4 slot signature which 
is repeated 64 times. If any elements of the component 
signature clashes with an existing reservations (see Fig- ss 
ure 3), the cpu reservation component 13 tries moving 
the clashing elements of the component signature ac- 
cording to their forwards and backwards flexibility val- 



ues and moving the clashing elements of the copy 31 of 
the master signature 32, according to their backwards 
and forwards flexibilities, until the component signature 
fits into the copy 31 of the master signature 32. If this 
cannot be achieved an exception is raised and commu- 
nicated back to the component 22 via the primary sched- 
uler 1 2. If merging is achieved, the master signature 32 
is updated according to the modified copy 31 and this 
information is communicated back to the primary sched- 
uler 12. The primary scheduler 12 then returns to reser- 
vation ID to the calling component 22. 
[0030] The cpu reservation component 13 always 
maintains at least a predetermined minimum number of 
elements of the master signature 32 free for the set of 
best effort applications 25. Each element of the master 
signature 32 corresponds to one cpu access time slice, 
e.g. 20ms. The cpu reservation component 13 automat- 
ically deletes reservations, requested with a duration 
parameter, that have time-expired. 
[0031] The cpu secondary scheduler 16 uses the 
master signature 32 cyclically to allocate cpu time to 
components 22. The use of allocated cpu time by the 
threads 23 within a component 22 is the responsibility 
of the application programmer. 

[0032] Priority based scheduling is used to schedule 
threads 23 which are very high priority threads (above 
priority level 15) that are essential to the integrity of the 
system. This means that threads 23 associated with 
time-critical operating system tasks, do not have to un- 
dergo the process of reservation and admission, as de- 
scribed above. The master signature 32 is used specif- 
ically for priority levels 0-15. The reason for this ap- 
proach is to make possible support for legacy operating 
system code and services that are essential to the sta- 
bility of the system. 

[0033] The cpu secondary scheduler 16, uses the 
standard Round-Robin scheduling algorithm for threads 
with a priority greater than 15. Generally, these threads 
will not require a complete time-slice and are therefore 
negligible in terms of cpu access time. 
[0034] Referring to Figure 4, the cpu secondary 
scheduler 1 6 repeatedly performs the following process. 
The cpu secondary scheduler 1 6 determines whether a 
new thread is pending in the dispatcher 11 (step s1). If 
a new thread is pending, the cpu secondary scheduler 
16 determines whether a thread is currently running 
(step s2). If a thread is currently running, it determines 
whether the new pending thread has a higher priority 
than the running thread (in this context a guaranteed 
service thread has "priority 1 ' over a best-effort service 
thread with the same priority) (step s3). If the new 
thread's priority is higher, the cpu secondary scheduler 
16 pre-empts the running thread, dispatches the new 
thread and places the pre-empted thread at the head of 
its priority level queue in the dispatcher 11 (step s4). 
[0035] If, at step s1, a new thread is not ready or, at 
step s2, no thread is running, the cpu secondary sched- 
uler 16 determines whether the current time slice has 
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expired (step s5). If the current time-slice has not ex- 
pired then the process returns to step s1 else dispatches 
the thread (step s7) at the head of the dispatcher queue 
for the highest priority greater than 15 (step s6). 
[0036] If, at step s6, no threads with priorities greater 
than 15 are pending, the cpu secondary scheduler 16 
inspects the next element of the master signature (step 
s8). If the element is empty, the cpu secondary sched- 
uler 1 6 dispatches the thread from the front of the high- 
est priority queue that is not empty (step_s9). If the ele- 
ment is not empty, the cpu secondary scheduler 16 
looks for a thread belonging to the application, identified 
in the master signature element, in the highest priority 
populated queue in the dispatcher 11 (step s10). If one 
is found then it is dispatched (step s11 ) else the thread 
from the front of the queue is dispatched (step s1 2) and 
the cpu secondary scheduler 16 attempts to move the 
reservation (step si 3). This dynamic slot shift can only 
be made to the extent of the next slot reserved by the 
same guaranteed service application 21. The reserva- 
tion is returned to its correct position for the next cycle 
through the master signature and if the master signature 
is updated by the cpu reservation component 13. 
[0037] It should be noted that whenever a guaranteed 
service thread cannot be serviced at steps s8 and s10, 
the best-effort service application 25 has the possibility 
of having its threads 26 dispatched. 
[0038] When a component 22 of the guaranteed serv- 
ice application 21 is destroyed, its destructor 27 it may 
invoke a reservation cancel method of the primary 
scheduler 12, passing the reservation ID as a parame- 
ter. The primary scheduler 12 then invokes a cancella- 
tion method of the cpu reservation component 1 3 which 
removes the calling component's slot reservations from 
the copy 31 of the master signature 32 and then updates 
the master signature 32. 

[0039] Considering now the case of disk access, an 
IRP (I/O Request Packet) is an encapsulated request, 
which defines the processing requirements, including 
the type of operation to be carried out (read or write), 
the number of bytes to manipulate, the synchrony of the 
request etc.. 

[0040] The constructor 24 of a component 22 includes 
a procedure for registering the guaranteed service ap- 
plication 21 for disk access. This process calls a disk 
access reservation method of the primary scheduler 12 
with a demand level indicator, usually a percentile, as a 
parameter. The primary scheduler 12 then calls a disk 
access reservation method of the hard disk drive reser- 
vation component 1 4 with the demand level indicator as 
a parameter. The hard diskdrive reservation component 
14 then allocates a number of "lottery tickets" to the ap- 
plication 21 containing the component 22 in depend- 
ence on the demand level indicator. The component 22 
may request an abstract reservation in terms of a class 
identifier which the primary scheduler 12 can use to ref- 
erence a persistent lookup table mapping the appropri- 
ate demand level indicator 



[0041] When a thread 23, belonging to a guaranteed 
service application 21 , requires disk access, it sends an 
IRP to an operating systems I/O manager 20, which is 
then forwarded to the hard disk secondary scheduler 17 
s via the operating system's file system component 30. 
The operating system's file system component 30 in- 
creases the granularity of the access request, e.g. from 
a request for one large block a data to many small blocks 
of data. When the IRP reaches the hard disk secondary 

w scheduler 17, it is place in a wait array according to the 
guaranteed service application's process ID. The hard 
disk drive secondary scheduler 17 runs a lottery by ran- 
domly generating "ticket" numbers. If winning applica- 
tion 21 has an IRP in the wait array, the winning appli- 
es cation 21 is granted disk access and the longest waiting 
IRP for the winning application is passed on to a physical 
disk device driver process 32. When there is no winning 
ticket holder or the winning ticket holder does not have 
an IRP ready for servicing, then, an alternative IRP is 

20 scheduled via a simple cyclic selection scheme which 
includes IRPs from the best effort service application 25. 
[0042] When a component 22 of the guaranteed serv- 
ice application 21 is destroyed, its destructor 27 deter- 
mines whether it is the only instance of a component 22 

2S of the application 21 . If it is the last such instance, it in- 
vokes a cancel hard disk access method of the primary 
scheduler 1 2 which in turn invokes a cancellation meth- 
od of the hard disk drive reservation component 1 4. The 
hard diskdrive reservation component 14 then de-allo- 

30 cates the tickets allocated to the application 21 . 

[0043] Considering now the case of memory access, 
each application 21, 25 maintainsits own unique virtual 
address space, which is a set of memory addresses 
available for its threads to use. This address space is 

35 much larger than the RAM 2 of the computer. When a 
component 22, 26 references its application's virtual 
memory, it does so with a pointer which is used by a 
virtual memory manager to determine the location in 
RAM 2 to which the virtual address maps. However, 

40 some portions of memory may actually reside on disk 
1 0 or backing store. This means that data or code which 
is infrequently accessed is held on disk 1 0 thus retaining 
RAM 2 for better use. To facilitate this scheme, memory 
is managed in fixed size blocks known as "pages", which 

45 are usually 4Kb or 8Kb. If a component 22 references 
an address whose page is not in RAM 2, then a page 
fault occurs. This triggers a process known as "paging" 
which is the task of locating the faulted page within the 
page file and then loading the page into RAM 2. If there 

50 js insufficient space in RAM 2, then a page must be first 
swapped out and written to the page file. However, each 
process usually keeps a minimum number of pages 
locked into RAM 2 known as the "working set". 
[0044] All modern operating systems generally pro- 

55 tect memory in three forms:- physical hardware disal- 
lows any thread from accessing the virtual address 
space of other components, a distinction is made be- 
tween the mode of operation, kernel-mode (ring 0) 
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which allows threads access to system code and data, 
and user-mode (ring 3) which does not, and a page- 
based protection mechanism wherein each virtual page 
maintains a set of flags that determine the type of access 
permitted. However, these mechanisms are aimed at $ 
protecting a component's memory resources from un- 
wanted interference by other unauthorised processes in 
the system. They do not prevent a process from con- 
suming more than its reasonable share of memory. 
[0045] To aid understanding of the approach of the io 
present invention to the implementation of working set 
size control, the working set management scheme in 
Windows NT will be briefly described. When created, 
each component is assigned two thresholds, a minimum 
working-set size and maximum working-set size. The is 
minimum defines the smallest number of pages the vir- 
tual memory manager attempts to keep locked concur- 
rently in physical memory, whilst the maximum defines 
an expansion range which can be used if the component 
is causing a considerable number of page faults. If the 20 
working set is too small, then the component incurs a 
large number of paging operations and thus a substan- 
tial overhead is incurred through continuously swapping 
pages to and from the backing store. Alternatively, if the 
working set is too large, few page faults occur but the 25 
physical memory may be holding code and/or data 
which is infrequently referenced and thus overall effi- _ 
ciency is reduced. In Windows NT, the Memory Manager 
(MM) adjusts the working sets once every second, in 
response to page-in operations or when free memory 30 
drops to below a given threshold. If free memory is plen- 
tiful, the MM removes infrequently referenced pages on- 
ly from working sets of components whose current size 
is above a given minimum, this is known as aggressive 
trimming. However, if free memory is scarce, the MM 3S 
can force trimming of pages from any component until 
it creates an adequate number of pages, even beyond 
the minimum threshold. Of course, components which 
have extended working sets are trimmed in preference 
to those which have the minimum working set size. 40 
[0046] In the present embodiment, the constructor 24 
of a component 22 includes a procedure for reserving a 
working set. This procedure calls a memory reservation 
method of the primary scheduler 12 which, in turn calls 
a method of the memory reservation component 1 5. The *s 
memory reservation component 15 translates this re- 
quest into the appropriate expansion of a default mini- 
mum working set size. The minimum working set size 
regulates the number of pages an application 21 can 
concurrently lock (or pin) into RAM 2. Thus, a compo- so 
nent 22 that wishes to reserve a portion of memory, 
makes the reservation request via the primary scheduler 
12 to the memory reservation component 1 5 which then 
increases the working set thresholds accordingly. This 
adjustment is facilitated through operating system calls, ss 
It is then the responsibility of the component to pin the 
pages into RAM 2 as required. The pinning process is 
arbitrated on a first -come first-serve basis. If insufficient 



physical pages are available, the operating system will 
reject the pin request, therefore the component 22 is ex- 
pected to pin its reserved pages soon after allocation. 
[0047] If there are insufficient free physical memory 
resources, the memory reservation component 1 5 then 
attempts to force the processes of best-effort service ap- 
plications 25 to write pages back to file store until suffi- 
cient free physical pages are available. If insufficient 
space can be made, then the reservation is rejected. As 
with the other resource modules, a portion of the RAM 
2 is reserved for a minimal best-effort service. This is 
facilitated by maintaining a count of the total number of 
physical pages being used under the guaranteed serv- 
ice and checking this against a portion of the total phys- 
ical page capacity. It should also be noted that the 'best- 
effort' service is strictly a minimal guaranteed service 
(defined by the system assigned minimum working set 
size) in addition to the best-effort service through poten- 
tial expansion of the working set. It is assumed that proc- 
esses cannot bypass the primary scheduler 12 and di- 
rectly alter their own working set size. Finally, special 
consideration is made for pages which are being used 
for shared memory purposes. If any client of a shared 
page has undergone reservation, then the page is clas- 
sified as reserved even though other clients may not 
have made a reservation. 

[0048] A second embodiment is the same as the first 
embodiment described above except for the manner of 
operation of the cpu secondary scheduler 16. The op- 
eration of the cpu scheduler of the second embodiment 
will now be described. 

[0049] Referring to Figures 2 and 4, the cpu second- 
ary scheduler 16 repeatedly performs the following 
process. The cpu secondary scheduler 16 determines 
whether a new thread is pending in the dispatcher 11 
(step s1 01 ). If a new thread is pending, the cpu second- 
ary scheduler 16 determines whether a thread is cur- 
rently running (step s102). If a thread is currently run- 
ning, it determines whether the new pending thread has 
a higher priority than the running thread (in this context 
a guaranteed service thread has "priority" over a best- 
effort service thread with the same priority) (step s103). 
If the new thread's priority is higher, the cpu secondary 
scheduler 16 pre-empts the running thread, dispatches 
the new thread and places the pre-empted thread at the 
head of its priority level queue in the dispatcher 1 1 (step 
st 04). 

[0050] If, at step s101, a new thread is not ready or, 
at step s102, no thread is running, the cpu secondary 
scheduler 1 6 determines whether the current time slice 
has expired (step s1 05). If the current time-slice has not 
expired then the process returns to step s101 else dis- 
patches the thread (step s107) at the head of the dis- 
patcher queue for the highest priority greater than 15 
(step s106). 

[0051 ] If, at step s1 06, no threads with priorities great- 
er than 1 5 are pending, the cpu secondary scheduler 1 6 
inspects the next element of the master signature (step 
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s108). If the element is empty, thecpu secondary sched- 
uler 16 dispatches the thread from the front of the high- 
est priority queue that is not empty (step s109). If the 
element is not empty, the cpu secondary scheduler 16 
looks for a thread belonging to the application, identified 
in the master signature element, in a queue in the dis- 
patcher 11 (step s110), searching from the priority 15 
queue towards the lowest priority queue and from the 
head of each queue to the tail thereof. If one is found 
then it is dispatched (step s 111 ) else the thread from the 
front of the highest priority populated queue is dis- 
patched (step s112) and the cpu secondary scheduler 
16 attempts to move the reservation (step s113). This 
dynamic slot shift can only be made to the extent of the 
next slot reserved by the same guaranteed service ap- 
plication 21. The reservation is returned to its correct 
position for the next cycle through the master signature 
and if the master signature is updated by the cpu reser- 
vation component 13. 

[0052] It should be noted that whenever a guaranteed 
service thread cannot be serviced at steps s108 and 
s110 : the best -effort service application 25 has the pos-_ 
sibility of having its threads 26 dispatched. 
[0053] The present invention has been described with 
reference to applications comprising components which 
in turn comprise one or more threads. It will be appreci- 
ated that the present invention is not limited to this sit- 
uation and may be implemented in a system in which 
application programs are not built using object-oriented 
languages. 

[0054] Whilst the present invention has been de- 
scribed with reference to cpu, hard disk and memory ac- 
cess, it will be appreciated that it may be applied to the 
scheduling of access to other resources. 
[0055] It will be appreciated that the term "lottery tick- 
et" is used figuratively. 



Claims 

1. A method of administering resource utilisation in a 
computer, the method comprising running a first 
process to make a reservation for access to a re- 
source in dependence on a resource requirement 
communication from an application process and 
running a second process to grant requests for ac- 
cess to said resource from said application process 
in dependence on said reservation. 

2. A method according to claim 1 , comprising: 
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invoking a method of the scheduling object from 
an application process, said method taking a 
first resource access requirement definition as 
a parameter; 

invoking, from said method of the scheduling 
object with a second resource access require- 
ment definition as a parameter, a reservation 
method of the reservation object to make a res- 
ervation for the application process; 
running a resource specific scheduling process 
to grant access to a resource in dependence 
on the reservation made by the reservation ob- 
ject; and 

utilising said resource for the purposes of said 
application process. 

3. A method according to claim 2, wherein said meth- 
od of the scheduling object translates the first re- 
source requirement definition into the second re- 
source requirement definition. 

4. A method according to claim 1 , 2 or 3, wherein said 
resource is a cpu. 



25 s. a method according to claim 1 , 2 or 3, wherein said 
resource is a mass storage device. 

6. A method of scheduling accessto a cpu, the method 
comprising the steps of: 
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20 



30 



35 



40 



45 



SO 



generating a one-dimensional reservation re- 
quest pattern; 

merging the reservation request pattern with a 
one-dimensional cpu access control pattern, 
representing empty cpu access time slots and 
reserved cpu access time slots, without sub- 
stantially disturbing either the reservation re- 
quest pattern or the reserved cpu access time 
slots in the reservation request pattern. 

7. A method according to claim 6 : wherein said merg- 
ing step comprises relocating a non-empty time slot 
element of the reservation request pattern or the 
cpu access control pattern such that the patterns 
can be merged without any reserved cpu access 
time slot elements being deleted or overwritten. 

8. A method according to claim 7, wherein the relocat- 
ed non-empty time slot element is relocated by an 
amount defined in said time slot element. 



instantiating a scheduling object having a meth- 
od or methods for processing reservation re- 
quests for a plurality of resources and initiating 
resource specific reservation processing; 55 
instantiating a reservation object having a 
method or methods for making reservations for 
access to a resource; 



9. A method according to claim 1 , wherein said first 
process is a method according to claim 6, 7 or 8. 

10. A method of granting access to a cpu comprising: 

generating a one-dimensional cpu access con- 
trol pattern, each element of which relates to a 
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quantum of cpu access time; and 

at the end of a quantum of cpu access time: 



granting access to any pending processes 
having a priority greater than a predeter- 5 
mined level; and then 
if the next pattern element is empty then 
granting access to a pending process 
meeting a predetermined prioritisation cri- 
terion else granting access, to a process 10 
identified in the pattern element. 

11. A method according to claim 10, wherein pending 
processes populate queues having different priori- 
ties and access is granted to the process identified is 
in the pattern element when there is not a populated 
process queue having a higher priority than the 
queue in which said process is present. 



12. A method according to claim 10 or 11, wherein the 20 
a one-dimensional cpu access control pattern is 
generated by a method according to claim 6 : 7 or 8. 

13. A method according to claim 2, wherein the second 
process performs a method according to claim 10 25 
or 11. 

14. A method according to claim 1, wherein said first 
process comprises allocating one or more lottery 
ticket numbers to said application process in de- 30 
pendence on the second resource access require- 
ment definition and the second process performs a 
method comprising the steps of: 



storing requests for access to a mass storage 35 
device from application processes; 
generating substantially randomly a lottery tick- 
et number; and 

if no application process has been allocated 
said lottery ticket number then passing on to a *o 
mass storage device driver process the stored 
request for access from an application process 
selected on the basis of a predetermined prior- 
itisation criterion else passing on to a mass 
storage device driver process a stored request 45 
for access from an application process to which 
said lottery ticket number was allocated. 



15. A computer configured to operate in accordance 
with a method according to any preceding claim. so 
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12 



BNSDOCID:<EP 1037147A1 I > 



EP 1 037 147 A1 





Figure 5 



13 



3NSDOCID: <EP 1037147A1 I > 



EP 1 037 147 A1 




European Patent 
Office 



EUROPEAN SEARCH REPORT 



Application Number 

EP 99 30 3936 



o 

3 
a. 

pj 
o 

trj 
o 



2 



DOCUMENTS CONSIDERED TO BE RELEVANT 



Category 



X 
A 



X 
A 



X 
A 



Citation ol document with indication, where appropriate, 
of relevant passages 



EP 0 798 638 A (HITACHI LTD) 
1 October 1997 (1997-10-01) 

* column 8, line 22 - column 14, line 20 * 

* column 32, line 56 - column 33 , line 24 
* 

US 5 812 844 A (DRAVES JR RICHARD P ET 
AL) 22 September 1998 (1998-09-22) 

* column 6, line 56 - column 10, line 23 * 

* claims 1-3 * 

EP 0 817 041 A (SUN MICROSYSTEMS INC) 
7 January 1998 (1998-01-07) 

* column 2, line 54 - column 5, line 31 * 

* claims 1-7 * 

US 5 247 677 A ( WELLAND ROBERT V ET AL) 
21 September 1993 (1993-09-21) 

* column 5, line 24 - column 6, line 53 * 

EP 0 790 557 A (MATSUSHITA ELECTRIC IND CO 
LTD) 20 August 1997 (1997-08-20) 

* page 8, line 22 - page 13, line 38 * 

EP 0 658 841 A (IBM) 

21 June 1995 (1995-06-21) 

* column 5, line 34 - column 7, line 8 * 



Relevant 
to claim 



1,4,6,7, 
9,15 
2,3,5, 
10,12,13 



The present search report has been diawn up for all claims 



Place of search 



THE HAGUE 



1,3,4,15 

2,5-10, 
12,13 



1,3 
2,4,5 

1.14 



CLASSIFICATION OF THE 
APPLICATION (lnt.CI.7) 



G06F9/46 



1,14 



TECHNICAL FIELDS 
SEARCHED (lnt.Cl.7) 



1,10 



G06F 



Date of completion ol the search 

29 September 1999 



X 
Y 

A 

o 
p 



CATEGORY OF CITED DOCUMENTS 

particularly relevani if taken alone 
particularly relevant if combined with another 
document of the same category 
technological background 
: non-written disclosure 
intermediate document 



Examiner 



BIJn, K 



T : theory or principle underlying the invention 
E : earlier patert document, but published on, or 

after the filing date 
D : document cited in the application 
L : document ciled for other reasons 



& : member of the same patent family, corresponding 
document 



14 



1037147A1 I _> 



4 



EP 1 037 147 A1 



ANNEX TO THE EUROPEAN SEARCH REPORT 

ON EUROPEAN PATENT APPLICATION NO. EP 99 30 3936 



This annex lists the patent family members relating to the patent documents cited in the above-mentioned European search report. 
The members are as contained in the European Patent Office EDP file on 

The European Patent Office is in no way liable for these particulars which are merely given for the purpose of information. 

29-09-1999 



Patent document 




Publication 




Patent family 




Publication 


cited in search report 




date 




member(s) 




date 


EP 0798638 


A 


01-10-1997 


CA 


2200929 


A 


28-09-1997 








JP 


2904483 


B 


14-06-1999 








JP 


9319597 


A 


12-12-1997 


US 5812844 


A 


22-09-1998 


NOME 








EP 0817041 


A 


07-01-1998 


US 


5826082 


A 


20-10-1998 








JP 


10063519 


A 


06-03-1998 


US 5247677 


A 


21-09-1993 


FR 


2691557 


A 


26-11-1993 








JP 


6035726 


A 


10-02-1994 


EP 0790557 


A 


20-08-1997 


JP 


9282184 


A 


31-10-1997 


EP 0658841 


A 


21-06-1995 


US 


5487170 


A 


23-01-1996 








JP 


7200318 


A 


04-08-1995 



2 
o 

Q. 



lu For more details about this annex : see Official journal of the European Patent Office, No. 12/82 



15 



BNSDOCID: <EP 



1037147A1 I > 




THIS PAGE BLANK (uspto) 



